Date Source Site ID POC Daily Mean PM2.5 Concentration Units
<char> <char> <int> <int> <num> <char>
1: 01/05/2002 AQS 60010007 1 25.1 ug/m3 LC
2: 01/06/2002 AQS 60010007 1 31.6 ug/m3 LC
3: 01/08/2002 AQS 60010007 1 21.4 ug/m3 LC
4: 01/11/2002 AQS 60010007 1 25.9 ug/m3 LC
5: 01/14/2002 AQS 60010007 1 34.5 ug/m3 LC
6: 01/17/2002 AQS 60010007 1 41.0 ug/m3 LC
Daily AQI Value Local Site Name Daily Obs Count Percent Complete
<int> <char> <int> <num>
1: 81 Livermore 1 100
2: 93 Livermore 1 100
3: 74 Livermore 1 100
4: 82 Livermore 1 100
5: 98 Livermore 1 100
6: 115 Livermore 1 100
AQS Parameter Code AQS Parameter Description Method Code
<int> <char> <int>
1: 88101 PM2.5 - Local Conditions 120
2: 88101 PM2.5 - Local Conditions 120
3: 88101 PM2.5 - Local Conditions 120
4: 88101 PM2.5 - Local Conditions 120
5: 88101 PM2.5 - Local Conditions 120
6: 88101 PM2.5 - Local Conditions 120
Method Description CBSA Code
<char> <int>
1: Andersen RAAS2.5-300 PM2.5 SEQ w/WINS 41860
2: Andersen RAAS2.5-300 PM2.5 SEQ w/WINS 41860
3: Andersen RAAS2.5-300 PM2.5 SEQ w/WINS 41860
4: Andersen RAAS2.5-300 PM2.5 SEQ w/WINS 41860
5: Andersen RAAS2.5-300 PM2.5 SEQ w/WINS 41860
6: Andersen RAAS2.5-300 PM2.5 SEQ w/WINS 41860
CBSA Name State FIPS Code State
<char> <int> <char>
1: San Francisco-Oakland-Hayward, CA 6 California
2: San Francisco-Oakland-Hayward, CA 6 California
3: San Francisco-Oakland-Hayward, CA 6 California
4: San Francisco-Oakland-Hayward, CA 6 California
5: San Francisco-Oakland-Hayward, CA 6 California
6: San Francisco-Oakland-Hayward, CA 6 California
County FIPS Code County Site Latitude Site Longitude Year
<int> <char> <num> <num> <num>
1: 1 Alameda 37.68753 -121.7842 2002
2: 1 Alameda 37.68753 -121.7842 2002
3: 1 Alameda 37.68753 -121.7842 2002
4: 1 Alameda 37.68753 -121.7842 2002
5: 1 Alameda 37.68753 -121.7842 2002
6: 1 Alameda 37.68753 -121.7842 2002
setnames(combined_data, old ="Daily Mean PM2.5 Concentration", new ="PM2.5")setnames(combined_data, old ="Site Latitude", new ="Latitude")setnames(combined_data, old ="Site Longitude", new ="Longitude")setnames(combined_data, old ="Local Site Name", new ="Site")setnames(combined_data, old ="Daily Obs Count", new ="Obs.Count")setnames(combined_data, old ="Daily AQI Value", new ="AQI")
# A tibble: 1 × 5
State mean_PM2.5 median_PM2.5 sd_PM2.5 count
<chr> <dbl> <dbl> <dbl> <int>
1 California 10.1 7.6 9.82 75732
ggplot(combined_data, aes(x = State, y = PM2.5, fill = State)) +geom_boxplot() +labs(title ="PM2.5 Levels by State", x ="State", y ="PM2.5") +theme_minimal()
ggplot(combined_data, aes(x = PM2.5, fill = County)) +geom_histogram(bins =20, alpha =0.6, position ="identity") +facet_wrap(~ County) +labs(title ="Distribution of PM2.5 Levels by County", x ="PM2.5", y ="Frequency") +theme_minimal()
ggplot(la_sites, aes(x = Date, y = PM2.5, color = Site)) +geom_line(size =1) +geom_point(size =2) +labs(title ="PM2.5 Levels Over Time by Site in Los Angeles", x ="Date", y ="PM2.5 ") +theme_minimal()
Warning: Using `size` aesthetic for lines was deprecated in ggplot2 3.4.0.
ℹ Please use `linewidth` instead.